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ABSTRACT 

During human immunodeficiency virus type 1 (HIV- 
1) maturation, three different forms of nucleocapsid 
(NO) protein— NCp1 5 (p9 + p6), NCp9 (p7 + SP2) and 
NCp7 — appear successively. A mutant virus express- 
ing NCp15 shows greatly reduced infectivity. Mature 
NCp7 is a chaperone protein that facilitates remodel- 
ing of nucleic acids (NAs) during reverse transcrip- 
tion. To understand the strict requirement for NCp15 
processing, we compared the chaperone function of 
the three forms of NC. NCp15 anneals tRNA to the 
primer-binding site at a similar rate as NCp7, whereas 
NCp9 is the most efficient annealing protein. As- 
says to measure NA destabilization show a similar 
trend. Dynamic light scattering studies reveal that 
NCp15 forms much smaller aggregates relative to 
those formed by NCp7 and NCp9. Nuclear magnetic 
resonance studies suggest that the acidic p6 domain 
of HIV-1 NCp15 folds back and interacts with the ba- 
sic zinc fingers. Neutralizing the acidic residues in 
p6 improves the annealing and aggregation activ- 
ity of NCp15 to the level of NCp9 and increases the 
protein-NA aggregate size. Slower NCp15 dissoci- 
ation kinetics is observed by single-molecule DNA 
stretching, consistent with the formation of electro- 
static inter-protein contacts, which likely contribute 
to the distinct aggregate morphology, irregular HIV-1 
core formation and non-infectious virus. 



INTRODUCTION 

Human immunodeficiency virus type 1 (HIV-1) nucleocap- 
sid protein (NC or NCp7) is a small, 55-residue protein that 
contains two C-C-H-C-type zinc fingers (ZFs) and a basic 
N- terminal domain (NTD) (1) (Figure lA). It plays numer- 
ous critical roles in the HIV-1 life cycle, such as facihtat- 
ing viral assembly (2-5), genomic ribonucleic acid (RNA) 
packaging (6-9), RNA dimerization (10-13), reverse tran- 
scription (14-17) and integration (18-23). NC is also a nu- 
cleic acid (NA) chaperone protein that destabilizes stable 
NA structures and refolds them to a thermodynamically 
more favored state (14,16,17,24,25). HIV-1 NC's chaper- 
one function is essential during reverse transcription of the 
highly structured viral RNA genome (gRNA) (26). For ex- 
ample, as a domain of the HIV-1 Gag precursor, HIV-1 
NC is involved in the placement of the tRNA^^^^ primer 
onto the gRNA primer binding site (PBS) (27-29). Ma- 
ture NC also plays roles in minus- and plus-strand transfer 
(16,17,30). Mutation of intact NC greatly reduces viral in- 
fectivity (7,31,32) and thus NC is a potential target protein 
for the development of new antiretroviral agents (17,33-36). 

NC is first synthesized as a domain of HIV-1 Gag. Dur- 
ing viral maturation, HIV-1 protease cleaves Gag into ma- 
trix (MA), capsid (CA), NCp7, p6 and two small peptides, 
spacer peptides 1 and 2 [SPl and SP2, also referred to as p2 
and p 1 , respectively (37)] . The cleavage reactions occur step- 
wise (38^0) in a highly regulated manner (37,41). Three dif- 
ferent forms of NC appear sequentially during this process 
(Figure IB). HIV-1 protease first cleaves after SPl, generat- 
ing a form of NC called NCpl5, which consists of mature 
NCp7 linked to SP2 and p6. Further processing of NCpl5 
results in NCp9 (NCp7 + SP2), and finally cleavage between 
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NCp7 and SP2 results in release of mature NCp7 (Figure 
IB). NCpl5 and NCp9 appear only transiently but their 
lifetime is specifically programmed (41). Additional cleav- 
age sites within NCp7 have been proposed based on in vitro 
peptide cleavage experiments (42), but NCpl5, NCp9 and 
NCp7 are the three major forms that have been observed 
by the numerous studies mentioned thus far on Gag and 
NC precursor cleavage processes. Although all three forms 
of NC have an overall positive charge (p/ = 9.50-10.15), 
the C-terminal p6 domain of NCpl5 contains eight acidic 
residues and has a p/of 4.50 (Figure lA). 

Previous studies showed that mutation of the protease 
cleavage sites proximal to the NC domain will affect HIV-1 
infectivity. The correct processing of the cleavage site be- 
tween SP2 and p6 is critical for the virus and virus con- 
taining NCpl5 is non-infectious (43-46). However, there is 
some debate regarding the effect of mutating the NCp7-SP2 
cleavage site (expressing NCp9); some studies reported that 
mutations at this site lead to non-infectious virus (44,47), 
while in other studies a virus with NCp9 retains some in- 
fectivity (43,45,46,48). The different mutations introduced 
to block the processing of this cleavage site generate HIV-1 
variants that likely have variable effects on other critical vi- 
ral replication processes (e.g., assembly, core condensation, 
reverse transcription, integration, etc.) In addition, the ex- 
periments were performed in the context of different HIV- 1 
strains, which has been reported to lead to different out- 
comes (44,49). Therefore, it is difficult to determine the ef- 
fect of authentic NCp9 and NCpl5 on HIV-1 replication 
using cell culture assays. 

HIV-1 NCpl5 bears some resemblance to HTLV-1 NC, 
which has both a basic NTD containing the two ZF motifs 
and an acidic C-terminal domain (CTD). A previous study 
showed that the acidic CTD of HTLV-1 NC negatively reg- 
ulates its chaperone activity (50). Based on a fluorescence 
study, it was suggested that the CTD folds back and in- 
teracts with the ZF domain (51). Additional biochemical 
studies suggested that the basic NTD of HTLV-1 NC inter- 
acts with the CTD either intramolecularly or intermolecu- 
larly, in the absence or presence of NAs, respectively (50). 
Whether a similar interaction regulates the chaperone ac- 
tivity of HIV-1 NCpl5 is unknown. 

The p6 domain of Gag interacts with several cellular and 
viral proteins (52), such as human proteins TsglOl (53,54) 
and AIP/ALIX (55,56), and the viral accessory protein Vpr 
(57-59). TsglOl and AIP/ALIX play roles during HIV-1 
budding (53,54,56). Vpr has several functions during HIV- 
1 infection and is required for efficient viral replication in 
non-dividing cells (60-62). A significant amount of Vpr is 
also packaged into virions. Interestingly, it has been shown 
that p6 is excluded from the capsid core after HIV- 1 matu- 
ration, while NC and Vpr are both located inside the core 
structure (63,64). It is possible that NCpl5 can interact with 
these cellular and viral factors through p6, and these inter- 
actions could affect NCpl5's chaperone function in vivo. Al- 
ternatively, the trafficking of the reverse transcription com- 
plex after HIV-1 cell entry may be negatively impacted by 
p6 fusion to NCp7/SP2. 

In this study, we used biochemical and biophysical ap- 
proaches to characterize and compare the NA binding and 
chaperone properties of mature NCp7 to those of authen- 



tic precursor proteins NCp9 and NCpl5 that are only tran- 
siently found in virions (41). We also carried out cell-based 
assays to probe the potential roles of TsglOl and Vpr inter- 
actions on the negative effect of NCpl5 on virus infectivity. 
Our study supports the role of inter-protein contacts and 
abnormal protein-NA aggregate formation in the defects 
observed in NCpl5-containing HIV-1. 

MATERIALS AND METHODS 

Preparation of proteins and NAs 

The HIV-1 NC proteins and proviral plasmids used in this 
study were based on the pNL4-3 sequence (GenBank ac- 
cession number AF324493) (65). The mutations denoted in 
Figure 1 were introduced as follows (nt positions based on 
AF324493): E77A (nts 2148 and 2150 changed from a to c; 
also introduces a BstUI diagnostic restriction site); E83A 
and E48A (nts 2166, 2168, 2169 and 2671 changed from 
a to c; also introduces a BstUI diagnostic restriction site); 
E90A and E91A (nts 2689, 2690 and 2692 changed from a 
to c; also introduces a BstUI diagnostic restriction site); and 
ElOOA, D103A and E105A (nts 2219, 2226, 2228 and 2234 
changed from a to c; also introduce BanI, Pvul and Styl di- 
agnostic restriction sites). Additionally, the mutant pNL4- 
3 -based proviral plasmid that maintains NC in its pi 5 form 
(a gift from Davit E. Ott, Frederick National Laboratory 
for Cancer Research) (43) was further mutated, changing 
PTAP to LIRE (Figure lA, amino acid positions 78-81) 
and/or FRFG to SRSG (Figure lA, amino acid positions 
86-89) to potentially disrupt TsglOl and Vpr interactions 
with p6, respectively (53,66). For the PTAP to LIRE mu- 
tant, nts 2153 and 2156 were changed from c to t, nt 2157 
was changed from a to t, nt 2158 was changed from g to c, nt 
2159 was changed from c to g and nt 2 1 62 was changed from 
c to t. For the FRFG to SRSG mutation, nts 2177 and 2183 
were changed from t to c. Mutations were introduced using 
conventional molecular biology methods and confirmed by 
sequence analysis. 

Unlabeled recombinant NC proteins were purified us- 
ing a modification of a previously-described protocol 
(19). Briefly, Escherichia coli (E. coli) BL21(DE3) pLysS 
was transformed with pET32a-based (EMD Millipore, 
Billerica, MA, USA) NC expression plasmids. The NC 
proteins were expressed as fusion proteins with an N- 
terminal thioredoxin (Trx) tag and a tobacco etch virus 
(TEV) protease cleavage site (ENLYFQG) between the 
Trx and NC regions that allowed for release of Trx 
and generation of the authentic NC proteins shown in 
Figure lA. After induction with 1 mM isopropyl (3- 
D-l-thiogalactopyranoside (IPTG) at room temperature 
overnight, cells were harvested and resuspended in 100- 
mM A^-cyclohexyl-3-aminopropanesulfonic acid buffer, pH 
10.4 at 4°C. Sonication was used to lyse the cefls and 2- 
mercaptoethanol (BME) was added to 1% final concentra- 
tion. The pH of the buffer was lowered to ~2 with 20% 
trifluoro acetic acid (TEA). The cell lysate, clarified by (i) 
ultracentrifugation at 51k x g in a Beckman (Beckman 
Coulter, Inc., Brea, CA, USA) Type 45 Ti rotor for 15 min 
at 4°C and (ii) filtering the resulting supernatant through 
a 0.45- jxm cellulose acetate filter (Corning Life Sciences, 
Tewksbury, MA, USA), was loaded onto a reverse phase 
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Figure 1. The HIV-1 NC proteins used in this study. (A) Sequence and structural features of NCp7, NCp9, NCpl5 and the three NCpl5 mutants (8 A, 5 A 
and C3A). Basic residues are in blue, acidic residues are in red and the ZFs are shown. (B) Three different forms of NC appear during HIV-1 maturation. 
The arrow heads indicate the cleavage sites that generate NCpl5, NCp9 and NCp7. 



high-performance hquid chromatography (rp-HPLC) CI 8 
column (Varian Microsorb 300-10, now Agilent, Santa 
Clara, CA, USA) and eluted with an acetonitrile gradient 
(10-90%) containing 0.1% TFA. The fractions correspond- 
ing to the desired product were collected and lyophilized. 
The fusion proteins were cleaved with 5-|jLg TEV protease 
per 100-|jLg protein (67,68) in a buffer containing 20-mM 
Tris-HCl, pH 7.4, 200-mM NaCl, 10-mM zinc acetate, 5- 
mM sodium citrate and 5-mM BME. The final cleaved 
NC product was further purified by rp-HPLC using a CI 8 
column as described above. The rp-HPLC fractions, cho- 
sen by matrix-assisted laser desorption/ionization-time of 
flight mass spectrometry (MALDI-TOF MS), were quanti- 
fied by amino acid analysis and combined with 1 . 1 equiv- 
alent of zinc-acetate per ZF then lyophilized and stored 
at — 80°C. Final purity was gauged by amino acid anal- 
ysis, amino acid sequencing and MALDI-TOF MS, and 
was found to be >95% pure (impurities were mainly NC 
proteins with minor N-terminal truncations). Chemically- 
synthesized NCp7 obtained from New England Peptide 
(Gardner, MA, USA) was used in some studies. It was re- 
folded with 1 . 1 equivalents of zinc ions per ZF in a buffer 
containing 20-mM HEPES, pH 7.5 and 0.1 -mM tris(2- 
carboxyethyl)phosphine (TCEP). The concentrations were 
determined by measuring the ultraviolet absorbance at 280 
nm using the following extinction coefficients: NCp7: 5690 
M-i cm-i; NCp9: 11 380 M-^ cm-^; NCpl5 and NCpl5 
mutants: 12 660 M"^ cm"^. 

^^N- and ^^C-labeled HIV-1 NC proteins were pre- 
pared under native conditions using a modification 
of a previously-described method (69). Briefly, E. coli 
BL21(DE3) pLysS was transformed with pET32a-based 
NCp7 expression plasmid (expressing Trx-NCp7) or 
pET3a-based (EMD Millipore) NCpl5 expression plas- 
mids. Bacteria were grown in minimal media supplemented 



with ^^N-NH4C1 and/or ^^C-Glucose as sole nitrogen 
and carbon source and were induced with 1-mM IPTG at 
room temperature overnight. Bacteria were harvested and 
resuspended in a buffer containing 50 mM Tris-HCl, pH 
7.4, 200-mM NaCl, 1-mM Zn(OAc)2, 10% glycerol and 
10-mM BME (loading buffer). After lysis of the bacteria 
by sonication, polyethylenimine (PEI) was added to a final 
concentration of 0.5% to precipitate NAs. The Trx-NCp7 
fusion protein was precipitated by 25% ammonium sulfate 
and cleaved by TEV protease as described above (67,68). 
After the TEV cleavage step (for NCp7) and after the PEI 
precipitation step (for NCpl5), proteins were purified on 
a HiTrap SP Sepharose FF 5 -ml column (GE Healthcare, 
Pataskala, OH, USA) using the above loading buffer and 
eluted with a linear NaCl gradient ranging from 200 mM 
to 1 M. The crude product from the SP column was further 
purified on a HiLoad Superdex 75 gel filtration column 
(GE Healthcare). The purified ^^N- or ^^N/^^C-labeled 
NCp7 and NCpl5 proteins were concentrated to ~0.5 mM 
using Amicon spin concentrators and dialyzed against 
degassed nuclear magnetic resonance (NMR) buffer (low 
sah: 25-mM NaOAc-d3, pH 6.5, 25-mM NaCl, 0.1-mM 
ZnCl2 and 0.1-mM BME; high salt: same as low salt except 
containing 500 mM NaCl). D2O (10% final) was added to 
the dialyzed proteins before NMR experiments. Gel-shift 
annealing assays were performed as described below to 
confirm that the NC proteins obtained using different 
methods were all active (Supplementary Figure SI). 

The peptide derived from HIV-1 CA helix 11, corre- 
sponding to residues 211-227, was purchased from Gen- 
Script (Piscataway, NJ, USA) and the p6 peptide, corre- 
sponding to residues 72-123 of NCpl5 (Figure lA), was 
synthesized by New England Peptide. These two peptides 
were purchased as HPLC-purified products and their iden- 
tity was confirmed by mass spectrometry. Polyglutamic acid 
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was purchased from Sigma-Aldrich (St. Louis, MO, USA) 
and used without further purification. 

Unmodified human tRNA^^^^ (hereafter referred to as 
human tRNA^^^^) and a 105-nt RNA construct that in- 
cludes the PBS of the HIV-1 genome (shortPBS) (70,71) 
were prepared by in vitro transcription (72). The RNA 
concentrations were determined using the following ex- 
tinction coefficients: tRNA^^'^: 664 700 M"^ cm"^ short- 
PBS: 935 700 M-i cm-i . 32p.labeled human tRNA^y^^ ^^^^ 
in annealing assays was prepared using the same method 
but with a-[^^P]-guanosine-5'-triphosphate (GTP) (Perkin 
Elmer, Shehon, CT, USA) instead of GTP 

Fluorescently-labeled shortPBS was prepared by la- 
beling purified RNA at the y end with fluorescein- 5- 
thiosemicarbazide as described (73,74). A 40-nt single- 
stranded (ss) deoxyribonucleic acid (DNA) used in salt 
titration experiments was purchased from Integrated 
DNA Technologies (Coralville, lA, USA): 5^-FAM-ACT 
GGA CCA AAT TCT AAT GTG TGT GTA CGA 
AGC ATA GCG G-3^ A 64-nt long DNA comple- 
mentary to the HIV-1 trans-activation response region 
(cTAR), labeled with 5^-AlexaFluor488 (AF) and 4-(4^- 
dimethylaminophenylazo)benzoic acid (DABCYL), was 
purchased from TriLink Biotechnologies (San Diego, CA, 
USA) (5^-AF-cTAR-DABCYL-30 (50). A singly-labeled 5^- 
AF-cTAR DNA was also obtained. 



Gel-shift annealing assays 

Time-course gel-shift annealing assays were performed 
as described (29,77). Briefly, 10-nM ^^P-labeled human 
tRNA^y^^ was mixed with 25-nM HIV-1 shortPBS in a 
buffer containing 50 mM HEPES pH 7.5, 5-mM DTT, 1- 
mM MgCl2 and 20-mM NaCl. The reaction was initiated 
by adding NC to a final concentration of 600 nM (un- 
less otherwise indicated). Aliquots were removed at dif- 
ferent time points and quenched with 1% sodium dode- 
cyl sulfate (SDS). Following two phenol-chloroform ex- 
tractions to remove proteins, reactions were run on SDS- 
polyacrylamide gel electrophoresis to separate the annealed 
product from the free tRNA^^^^ . The gels were analyzed on 
a Typhoon Trio Variable Mode Imager (GE Healthcare). 
Rate constants were calculated by fitting the data to a single- 
exponential equation. 

To determine the effect of specific peptides (p6, CA he- 
lix 11 or polyglutamic acid) on NCp7-facilitated anneal- 
ing, peptide concentration-dependent tRNA/shortPBS an- 
neahng assays were performed similar to the time-course 
assay. In these assays, 10-mM NaCl was used to promote 
NCp7-peptide interaction. The annealing reactions were 
performed in the presence of increasing concentrations of 
peptide (0-5 |jlM) and 800 nM of NCp7, at room temper- 
ature for 15 min prior to quenching with 1% SDS (NCp7 
and peptides were mixed first to allow complex formation). 



Fluorescence anisotropy binding experiments and salt titra- 
tion assays 

The fluorescence anisotropy (FA) binding experiments were 
carried out as described (75). Fluorescein-labeled shortPBS 
(19.2 nM) was incubated with various concentrations of NC 
(0-3 |xM) in 50 mM HEPES, pH 7.5, 5-mM dithiothre- 
itol (DTT), 1-mM MgCl2 and 20-mM NaCl. The reactions 
were incubated in the dark at room temperature for 30 min 
before measuring FA using a SpectraMax M5 plate reader 
(Molecular Devices, Sunnyvale, CA, USA). The data were 
analyzed as described, using a 1:1 binding model (75). 

Salt-titration binding assays were carried out as described 
(76). Briefly, a fixed amount of NC (100 nM or 400 nM) 
was incubated with 40-nt ssDNA (20-30 nM) in the pres- 
ence of increasing NaCl concentrations (0 mM to 1 M) in 
20-mM HEPES, pH 7.5, 20-|jlM TCEP, 5-mM BME and 
1-mM MgCl2. The reactions were incubated at room tem- 
perature in the dark for 30 min prior to FA measurements. 
To correct for the change of FA at high salt due to increased 
solution viscosity and/or DNA conformation changes, FA 
of control reactions in the absence of NC was also measured 
and the results were subtracted from the reactions with pro- 
teins. The data were analyzed as described (76). The disso- 
ciation constant, K^, varies with sodium ion concentration 
as follows: 

Kd = Kd^iM)-[Naf^\ (1) 

where Zgff is the effective charge of NC, which corresponds 
to the number of charges involved in direct NA interaction. 
K^d{\ M) denotes the dissociation constant of the protein in 
a buffer with 1-M NaCl when all electrostatic interactions 
have been screened out. 



Sedimentation assays 

Sedimentation assays were used to evaluate the aggregation 
capability of NC proteins (29). The RNA concentrations 
and buffer conditions were the same as in the time-course 
annealing assays, except the non-ionic detergent, Tween 20 
(Sigma), was added to 0.1% in some experiments. Various 
amounts of NC were added to the reactions, which were 
incubated at room temperature for 30 min to 2 h prior to 
centrifugation at 10 000 x g for 10 min. The radioactivity 
remaining in the supernatant was quantified by scintillation 
counting. The percentage of aggregated NA (y) versus pro- 
tein concentration (x) was fit to the Boltzmann equation: 



I -\- e dx 

Where Ai is set to 0 (the NA sedimentation before adding 
any proteins), A2 is the maximum percentage of NA aggre- 
gated, dx is the protein concentration change required for 
complete sedimentation and xq is the protein concentration 
at which half of the NA is sequestered by NC. 



Dynamic light scattering measurements 

Dynamic hght scattering (DLS) measurements were car- 
ried out in 50 mM HEPES pH 7.5, 5-mM DTT, 1.3-mM 
MgCl2 and 20-mM NaCl using 20-nM tRNA^y^^ 50. 
nM shortPBS. Measurements were performed 30 min af- 
ter 1.2-(xM NC addition on a Zetasizer Nano-ZS (Malvern 
Instruments Ltd, Worcestershire, UK) and analyzed by the 
Dispersion Technology Software provided. 
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TAR hairpin destabilization assays 

TAR hairpin destabilization experiments were performed 
similar to those previously described (29,50,78). Briefly, 
100 nM of 5'-AF-cTAR-DABCYL-3' was incubated with 
increasing concentrations of NC, in a buffer containing 
50 mM HEPES, 1-mM MgCl2, 100-mM NaCl, 10-|jlM 
TCEP and 5-mM BME. After a 30-min incubation at 
room temperature, fluorescence intensity was recorded on 
a SpectraMax M5 plate reader and fluorescence lifetime 
measurements were performed on a LifeSpec (red) time- 
resolved spectrometer (Edinburgh Instruments, Livingston, 
UK). The fluorescence lifetime data were fit to a triple- 
exponential decay model to obtain the fluorescence life- 
times (r) and populations of three states: B\, B2 and B3, 
corresponding to three distinct conformations (closed state: 
Ti ~ 0.2 ns, semi-open state: 12 ~ 1 ns and open state: T3 ~ 
3.5 ns) as described (78). Similar analysis for 5^-AF-cTAR 
was carried out and fitted to a two-states model. By taking 
into account the fluorescence intensity data, the population 
of a dark state (ao) can be estimated as follows (78): 



where <r>s and <r>exp are the mean fluorescence life- 
times for singly- and doubly-labeled species, respectively, 
and Fls and Flexp are the fluorescence intensities for singly- 
and doubly-labeled species, respectively. The value of <r> 
can be calculated by 

{T) = J2^i^i- (4) 

The dark state value (ao) was used to correct the values 
of the populations of hairpin (Bi) residing in the three fluo- 
rescent states as follows: 

ai = Bi(l-ao). (5) 
NMR data collection 

i^N/i^C-labeled NCp7 (0.4 mM) and NCpl5 (0.2 mM) in 
25-mM NaOAc-d3, pH 6.5, 25-mM NaCl, 0.1-mM ZnCls 
and 0.1-mM BME (low salt buffer) (or in high salt buffer: 
same as low salt except containing 500-mM NaCl) were 
placed into 5-mm NMR tubes. All spectra were collected 
at 25°C with a Bruker Avance 800-MHz (^H) NMR spec- 
trometer equipped with z-axis gradient cryogenic probe- 
heads. Backbone resonance assignments were made with 
standard heteronuclear triple resonance experiments (79). 
All data were processed with NMRPipe (80). Resonance as- 
signments were made with NMRView (81). The chemical 
shift differences were calculated based on 

^ ^ / (^H(NCp7) - 5h(NCp15))^ 2 ' (6) 
y (0.15 X (5n(NCp7) — ^N(NCpl5))) 

where 5h(ncp7) and <5h(ncp15) are the hydrogen chemical 
shifts and 5n(ncp7) and 5n(ncp15) are the nitrogen chemical 
shifts of NCp7 and NCpl5, respectively. The value 0.15 was 
a scaling factor used to correct for the larger range of nitro- 
gen chemical shift. 



Single-molecule DNA stretching studies 

We used dual beam optical tweezers to stretch single bacte- 
riophage X DNA molecules, labeled with biotin on both 5^ 
ends, as previously described (82). Stretching experiments 
were performed in 10-mM HEPES, 50 mM Na+, pH 7.5. 
After attachment of one DNA molecule, buffer was used 
to rinse out the other DNA molecules, and solutions con- 
taining specific protein concentrations were flowed around 
the DNA to investigate protein effects on DNA stretching 
curves. Both the DNA stretch and release force-extension 
curves in the presence of protein were then recorded at 
pulling rates in the range of 20-500 nm/s. 

Production of mutant and wild-type HIV-1 and infectivity 
analysis 

Mutant and wild-type (WT) proviral plasmids described 
above were transfected into 293T cells as described (7). Viri- 
ons were analyzed for single-round infectivity on TZM-bl 
cells (obtained through the NIH AIDS Research and Ref- 
erence Reagent Program, Division of AIDS, NIAID, NIH 
from Dr. John C. Kappes, Dr. Xiaoyun Wu and Tranzyme 
Inc.) as described previously (83). 

RESULTS 

NA binding properties 

The NA binding affinity of HIV- 1 NC proteins was stud- 
ied by FA. All three forms of NC bind the 105-nt shortPBS 
RNA sequence derived from the PBS region of the HIV-1 
genome (71) with very similar apparent values (Table 1). 
To further characterize and compare the NA binding prop- 
erties of NCp7, NCp9 and NCpl5, salt titration binding ex- 
periments were carried out (76). In these experiments, NC 
proteins were first incubated with a fluorescently-labeled 40- 
nt ssDNA and then NaCl was used to displace NC from 
the oHgonucleotide. ssDNA was used as a model system to 
avoid any complications of binding due to NA folding. The 
non-electrostatic binding component, m), of these three 
NC proteins is very similar, ~10~^ M (Table 1), as expected 
for the proteins sharing the same ZF structures. However, 
as expected based on differences in overall p/, slight differ- 
ences in the effective charge (Zeff) were observed. The Zgff of 
NCp7 is ~3.0, meaning that on average three Na^ cations 
are released from the NA phosphate upon protein binding. 
NCp9 possesses three more basic residues in the SP2 re- 
gion (Figure 1 A), but the effective charge only increases by 
1.3. Interestingly, the presence of acidic residues in the p6 
region (Figure lA) reduces the effective charge of NCpl5 
by 0.5 relative to NCp9. Thus, in the context of NCpl5 
the negative charges on the p6 domain may effectively neu- 
tralize some positive charges on NCp9. The effect of Zgff 
and Kd(iM) can be better visualized by calculating the de- 
pendence of the on salt (Figure 2). NCp7, NCp9 and 
NCpl5 show very similar binding affinity to the ssDNA at 
1-M NaCl. At lower salt concentrations, the differences be- 
tween these NC variants become significant. NCp9 binds 
with the highest affinity at low salt concentration consistent 
with the strongest electrostatic interactions, whereas NCp7 
is the weakest binder among the three (Figure 2). However, 
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Figure 2. Graph showing the dependence of dissociation constants (Kj) 
on NaCl concentration for NCp7, NCp9 and NCpl5. The binding was 
studied using a 40-nt ssDNA. 



in physiological salt of ~150 mM NaCl, the dissociation 
constants of all three NC proteins differ by only a few fold. 

NA annealing properties 

We next compared the NA annealing activities of NCp7, 
NCp9 and NCpl5 (29,71,77). In these assays, ^^P-labeled 
in vitro transcribed human tRNA^^^^ (76 nt) was incubated 
with shortPBS (105 nt) in the presence of different forms 
of HIV- 1 NC (Figure 1). Time-course annealing assays at 
600-nM protein concentration showed that NCpl5 can an- 
neal human tRNA^^^^ to the shortPBS at a similar rate as 
the mature NCp7 and the reactions reach similar extents 
of annealed product (Figure 3A and Table 2; gel examples 
are shown in Supplementary Figure S2). Surprisingly, HIV- 

1 NCp9 is the best annealing agent among the three forms 
of HIV- 1 NC. However, increasing the protein concentra- 
tion of NCp7 and NCpl5 to 2 (xM resulted in increased 
rates and extents of annealing that approached the level of 
NCp9 (Figure 3 A and Table 2). Immunoblots of samples 
taken from WT HIV-1 failed to detect NCpl5 in rapid har- 
vest (30 min) or 46-h virus (Supplementary Figure S3). In 
contrast, whereas 46-h virus contained only NCp7, rapid 
harvest virus particles contained a mixture of NCp7 and 
what appears to be NCp9 as the migration distance is sim- 
ilar to that of purified recombinant NCp9 (Supplementary 
Figure S3). Based on these findings in rapid harvest virus, 
preliminary annealing assays with mixtures of NC protein 
were also carried out. Adding NCp7 or NCpl5 (600 nM or 

2 (xM) to reactions containing 600-nM NCp9 does not neg- 
atively affect annealing (Supplementary Figure S4). nuUnull 
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Figure 3. Annealing of 10-nM human tRNALys3 to 25-nM HIV-1 short- 
PBS as a function of time in the presence of different forms of HIV-1 NC. 
(A) Comparison of the kinetics of annealing in the presence of the indicated 
concentration of NCp7, NCp9 and NCpl5. (B) Comparison of the an- 
nealing kinetics in the presence of 600-nM NCp9, WT NCpl5 and NCpl5 
acidic residue variants. (C) Effect on annealing of negatively charged pep- 
tides HIV-1 p6 and polyglutamic acid or neutral peptide CA helix 11. 



NA destabilization properties 

The NA chaperone function of retroviral NC proteins de- 
pends on two major properties — the capability of the pro- 
tein to destabilize stable duplex structures and to aggre- 
gate NAs (14,16,17,24). Fast NA interaction kinetics also 



plays a role in chaperone function (84). To further charac- 
terize the chaperone activity of NC, Forster resonance en- 
ergy transfer-based cTAR DNA hairpin destabilization as- 
says were adopted to evaluate HIV-1 NC's duplex destabi- 
lization activity (29,50,78). When NC interacts with cTAR, 
up to four populations of DNA can be detected [a 'dark 
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Table 1. Binding parameters measured for HIV-1 NCp7 and precursor proteins 





Kci (nUf 


^41 M) (M)b 


Zeff^ 


NCp7 


231 ±30 


(2.3±0.7)-10-5 


3.0 ±0.3 


NCp9 


240 ± 18 


(9.2±6.1)-10-5 


4.3 ± 0.7 


NCpl5 


278 ± 36 


(7.8±4.1)-10-5 


3.8 ±0.7 



^Determined from FA assays using 105-nt shortPBS in 50 mM HEPES, pH 7.5, 5-mM DTT, 1-mM MgCh and 20-mM NaCl. 
^Determined from salt titration assays using 40-nt ssDNA. K^(^{ m) and Zgff are defined as described in the text. 
Values are the average of at least three measurements with the standard deviation indicated. 



Table 2. Annealing rate constants of HIV-1 NC variants 


NC variant, concentration 


Final % annealing^ 


kohs (min 


Fold decrease^ 


NCp9, 600 nM 


93.3 ±0.5 


3.05 ±0.18 




NCp7, 600 nM 


50.4 ± 1.8 


0.08 ± 0.01 


38 


NCpl5, 600 nM 


66.6 ± 3.4 


0.14 ±0.02 


22 


NCpl5-5A, 600 nM 


94.8 ± 0.3 


2.38 ± 0.06 


1.3 


NCpl5-C3A, 600 nM 


91.2 ±0.4 


2.70 ±0.16 


1.1 


NCpl5-8A, 600 nM 


98.0 ±0.6 


1.48 ±0.14 


2.1 


NCp7, 2 |jlM 


95.5 ±0.1 


0.64 ± 0.02 


4.8 


NCpl5, 2 |jlM 


93.0 ±0.9 


0.34 ± 0.05 


9.0 



^ Final % annealing and /cobs were obtained by fitting annealing assay data to a single-exponential equation. 
^The fold decrease was calculated relative to the rate constant for NCp9 at 600 nM. 



State' (ao) and three more open states (ai, a2, as)]. The 
'dark state' is completely closed and becomes less populated 
with increasing binding of NC, while the open states be- 
come more populated as a result of NC's preferential bind- 
ing to ssDNA. Figure 4 shows a plot of ao (Figure 4A) and 
as, which is the most open state (Figure 4B), as a function 
of NC concentration. Based on these data, we conclude that 
the three forms of NC show comparable duplex destabiliza- 
tion activity. High concentrations of NCp9 induced aggre- 
gation of the DNA and thus the destabilization activity of 
NCp9 cannot be studied at the highest protein concentra- 
tions. (Supplementary Figure S5 summarizes the popula- 
tion shifts observed at the highest NC concentration used 
for all three forms of NC.) 

NA aggregation properties 

Sedimentation assays were performed to compare the 
NA aggregation capabilities of HIV-1 NCp7, NCp9 and 
NCpl5. In initial studies performed in the absence of the 
non-ionic detergent Tween 20, the same trend was observed 
as in the annealing assays (Figure 5A). NCp9 is the most 
effective agent, requiring the least amount of protein to ag- 
gregate half of the NA used in the reaction (C1/2 ~ 150 nM). 
NCpl5 and NCp7 appeared to show similar aggregation 
activity (C1/2 ~ 240 nM). However, in the absence of cen- 
trifugation, the presence of NCpl5 depleted NA in solution 
(data not shown), suggesting that the apparent sedimen- 
tation observed is due to adsorption of NCpl5-NA com- 
plexes to the sides of the microcentrifuge tubes used. When 
experiments were performed in the presence of 0. 1% Tween 
20, the effect was eliminated and NCpl5's aggregation ca- 
pability was measured as negligible (Figure 5B). Since the 
sedimentation assay does not assess aggregate morphology, 
DLS was used to study the aggregate size distribution of 
all three forms of NC. Interestingly, whereas NCp7 and 
NCp9 can form stable, large protein-RNA aggregates that 
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Figure 4. cTAR destabilization capability of HIV-1 NC. Changes in the 
population of the dark state (ao) (A) and the most opened state (a3) (B) of 
cTAR as a function of NC protein concentrations. 
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Figure 5. NA aggregating capabilities of different forms of HIV-1 NC. Results of sedimentation assays performed in the absence of Tween 20 (A) and in 
the presence of 0.1% Tween 20 (B), showing the percent of pelleted NA as a function of increasing protein concentration. (C) The NC-NA aggregate size 
distributions as determined by DLS for NCp7, NCp9 and NCpl5. (D) Effect of neutralizing the C-terminal acidic residues of NCpl5 on aggregate size 
distributions. 



scatter light effectively, the NCpl5-RNA complexes scat- 
ter light very weakly, resulting in small apparent sizes (Fig- 
ure 5C). In fact, RNA alone or NCpl5 alone show simi- 
lar calculated size distributions as NCpl5-RNA complexes 
(Supplementary Figure S6). Although the apparent aggre- 
gate size may not reflect the actual dimensions of NCpl5- 
NA complexes, these data strongly support the formation of 
very different complex morphology in the case of NCpl5. 
The calculated aggregate size distribution indicates that the 
sizes of complexes formed by all forms of NC vary, with 
NCp9 > NCp7 > > NCp 1 5 . 

Role of the p6 domain in regulating the chaperone function of 
NCpl5 

We hypothesize that the acidic nature of the p6 domain may 
play a role in distinct NA interaction properties of NCp 15. 
To test this hypothesis, several mutant forms of NCp 1 5 were 
generated containing three (C3A), five (5A) or eight (8A) 



E/D to A mutations (Figure lA). DLS analysis revealed 
that neutralizing five or eight acidic residues resulted in 
protein-NA aggregates that were similar in size distribution 
to those of NCp9 (Figure 5D). 

As shown in Figure 3B, neutralizing any number of acidic 
residues in p6 also restored annealing activity to the level 
of NCp9 (Table 2). Consistent with the negative impact of 
the acidic residues on annealing activity, a synthetic p6 pep- 
tide added to NCp7 in trans significantly inhibited NC's an- 
nealing activity of NCp7 (Figure 3C). A negatively charged 
polyglutamic acid inhibited NCp7 annealing activity even 
more strongly than p6, consistent with the higher negative 
charge density on the polyglutamic acid, while a neutral 
peptide derived from HIV-1 CAs CTD (85) does not have 
a significant effect (Figure 3C). Taken together, these data 
show that the acidic residues in p6 modulate both the ag- 
gregation and annealing properties of NCp 15. 

The NMR structure of the p6 domain alone shows that it 
is a largely unstructured peptide with two short helical seg- 
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ments forming in the presence of trifluoroethanol or in do- 
decylphosphocholine micelle solution (52,86). To determine 
whether p6 covalently appended to NCp9 results in a fold- 
back conformation as previously observed for HTLV- 1 NC 
(50,51), i^N-/^^C-labeled NCp7 and NCpl5 were prepared 
and backbone chemical shifts were determined with con- 
ventional triple resonance experiments. Figure 6A shows 
an overlay of the Heteronuclear Single Quantum Coherence 
(HSQC) spectra of NCp7 (red) and NCpl5 (black) recorded 
in 25-mM NaCl. While most signals corresponding to the 
NCp7 domain in the context of NCpl5 generally overlay 
very well with their counterparts in free-standing NCp7, 
some peaks are clearly perturbed (Figure 6A). Four residues 
(F16, A25, K33 and K38) in the ZFs show chemical shift 
differences greater than 0.05 ppm when comparing NCp7 
and NCpl5 (Figure 6B). These residues are close in space 
in the three-dimensional structure (69,87) (Figure 6C). In 
the presence of 500-mM NaCl, all of these chemical shift 
differences were significantly reduced in magnitude (Figure 
6B). Consistent with our NMR results, at higher salt (100- 
mM NaCl), the annealing rate differences observed between 
NCp9 and NCpl5 are reduced, and at 250-mM NaCl, the 
annealing rates of NCp9 and NCpl5, although both low, 
are identical (Supplementary Figure S7). 

Single-molecule DNA stretching studies 

We next used single-molecule DNA stretching (50,75,88,89) 
to probe the differences in NA interaction kinetics of HIV- 
1 NCp7, NCp9 and NCpl5. Typical DNA stretch-release 
cycles in the presence of 20 nM of each of the HIV-1 NC 
proteins performed at two different pulling rates (50 and 
500 nm/s) are presented in Supplementary Figure S8. All 
three proteins lengthen the DNA at high forces due to in- 
tercalation (90-92), while also destabilizing the DNA du- 
plex (88). As with all intercalators, increasing the force pro- 
motes additional binding and decreasing the force promotes 
protein dissociation. Therefore, the amount of NC bound 
is less than the equilibrium amount during stretching and 
more than the equilibrium amount during release, resulting 
in disagreement between the stretch and release curves, or 
hysteresis. Because the maximum hysteresis would be ob- 
tained for an infinitely slow protein that does not have time 
to bind during DNA stretching or to dissociate during re- 
lease, we normalize our measured hysteresis by this max- 
imum value to obtain the non-equilibrium component of 
NC binding. The hysteresis is quantified in Figure 7, which 
shows the non-equilibrium component of the DNA-bound 
NC molecules as a function of DNA stretching time, or the 
time it takes for a single stretch and release cycle at a given 
pulling rate. This non-equilibrium fraction represents the 
fraction of NC bound to DNA that exhibits binding kinet- 
ics slower than the time scale of stretching. For all measured 
stretching times, the non-equilibrium component is at least 
44% greater for NCpl5 relative to NCp7. 

The time behavior of this non-equilibrium DNA-bound 
NC component, F{i), exhibits multiple time scales, most 
likely due to the continuum of the sequence-dependent NC- 
DNA binding sites. F{t) can be minimally fit to a double 
exponential function 
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Figure 6. ^^n-Ir HSQC NMR analysis of NCp7 and NCpl5. (A) HSQC 
spectra collected in low salt (25-mM NaCl) NMR buffer for NCp7 (red) 
and NCpl5 (black). (B) Plot of the chemical shift differences between 
NCp7 and NCpl5 versus residue number for the ZF region at 25-mM 
NaCl (black) and 500-mM NaCl (dark cyan). The most significantly per- 
turbed residues are labeled. (C) Chemical shift differences between NCp7 
and NCpl5 observed at 25-mM NaCl shown on the NCp7 structure (pdb: 
lESK). Red regions indicate chemical shift changes greater than 0.05 ppm. 
Magenta regions indicate changes greater than 0.025 ppm but smaller than 
0.05 ppm. Zinc ions are shown as spheres. The backbone of NC is shown 
as ribbons. 



where /fast, /slow and Tfast, TsIow are the fast and the slow 
fractions of the non-equilibrium NC binding component 
and the corresponding relaxation times, respectively. The 
resulting parameters are given in Supplementary Table SI. 
The non-equilibrium components of the DNA-bound NC 
molecules have fast fractions varying from 0. 14 to 0.19, with 
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between p6 and TsglOl (53,93), while changing the FRFG 
motif at residues 86-89 of NCpl5 to SRSG should elim- 
inate p6-Vpr interaction (66). An HIV-1 clone with both 
motifs mutated was also generated. Single-round infectiv- 
ity assays were carried out using a TZM-bl cell line (83). If 
the interactions between NCpl5 and TsglOl or Vpr have 
negative effects on viral replication, eliminating these mo- 
tifs should rescue the viral infectivity defect observed in an 
HIV-1 variant expressing NCpl5. However, all of these mu- 
tations fail to show significant increases in infectivity com- 
pared to the virus expressing NCpl5 without the secondary 
changes (Table 3). Thus, these p6 interactions do not appear 
to be the major cause of the large decreases in viral infectiv- 
ity observed with NCpl5 containing HIV-1. 



Figure 7. Non-equilibrium component of DNA-bound HIV-1 NC 
molecules as a function of the DNA stretching time at 20-nM protein con- 
centration. The non-equilibrium component was obtained from the hys- 
teresis of the NC-DNA stretch-release cycle as a function of pulling rate 
normalized by the area between the force-extension curves for the protein- 
free dsDNA and protein-saturated DNA (89). The lines are fits to Equation 
(7) with the fit parameters summarized in Supplementary Table SI. 

relaxation times ranging from 25 to 47 s, and slow fractions 
with relaxation times in the range of 1000-5000 s. In addi- 
tion to having a larger non-equilibrium component for all 
stretching times, NCpl5 has a significantly larger slow frac- 
tion of -0.3, relative to -0.2 for NCp7 and NCp9. 

Cell-based assays 

Cell-based assays were carried out to determine the effect of 
neutralizing the acidic residues of p6 on viral infectivity. In 
the context of otherwise WT virus, the 8 D/E to A changes 
(see Figure 1) reduced viral infectivity by —20-fold in single- 
round TZB-bl infectivity assays [(83), Table 3], most likely 
because mutations disrupt other essential function of p6. 
Moreover, an HIV-1 mutant containing NCpl5-8A failed 
to rescue viral infectivity of the NCpl5-containing virus, 
which displayed only 1% infectivity compared to WT HIV- 
1; the infectivity of NCpl5-8A was —3-fold lower than that 
of NCpl5wT- However, it is apparent that these alterations 
did affect other properties of the mutant viruses. For exam- 
ple, in the case of the NCpl5-8A virus, an additional pro- 
tease cleavage site was introduced upon mutating the 8 D/E 
to A residues (Supplementary Figure S9); thus, the reduc- 
tion in infectivity may be due to these other factors and not 
necessarily elimination of ionic interactions between the p6 
domain and NC. This reinforces the importance of in vitro 
studies using recombinant proteins, since hypotheses are oc- 
casionally not directly demonstrable in the virus/cell setting 
due to unforeseen effects of mutations on viral processes. 
nuUnuU 

In vivo, the p6 domain has been reported to interact with 
many other cellular and viral proteins (52). These interac- 
tions may be detrimental to HIV-1 in the context of un- 
cleaved NCpl5. To test this hypothesis, two sets of muta- 
tions were designed in the context of NCpl5-containing 
virus to eliminate potential interactions with TsglOl and 
Vpr. Mutation of the PTAP motif at residues 78-81 of 
NCpl5 to LIRL was designed to disrupt the interaction 



DISCUSSION 

Previous cell-based assays showed that HIV-1 processing 
mutants that contain NCp9 resulted in variable effects on 
viral infectivity (43-48). These variations are likely due to 
the different mutations and HIV-1 strains used. In con- 
trast, processing mutations that result in NCpl 5-containing 
virus consistently led to large defects in viral infectivity and 
replication (43,45,46). Lending credence to the transient 
presence of some of these precursors in WT virus, we ob- 
served an a-NCp7-reactive band in an immunoblot of rapid 
harvest virus that migrates similarly to purified recombi- 
nant NCp9, along with NCp7 (Supplementary Figure S3), 
but disappears on longer incubation of virus. On the other 
hand, an NCpl 5 species was not detectable in rapid harvest 
or 46-h harvest HIV-1, in agreement with previous results 
reported by Kaplan et al (94). 

In this work, to understand the strict requirement for 
NCpl 5 processing for HIV-1 replication, we compared the 
NA binding and chaperone functions of the three major 
forms of NC, mature NCp7, NCp9 and NCpl 5. We ob- 
served only minor differences in these in vitro activities, con- 
sistent with the conclusion that an NCpl5-expressing virus 
was able to carry out reverse transcription (43,47). Consid- 
ering that the effective concentration of NC is quite high in 
virions (— mM), it is hkely that NCpl 5 can carry out RNA 
annealing efficiently during HIV-1 infection. NCp9 actually 
displayed the highest activity in all the assays conducted 
here (NA binding, anneahng, aggregation and destabiliza- 
tion). In a parallel study on the effects of the three NC pro- 
teins on HIV-1 reverse transcriptase reactions, Wu et al also 
find that under optimal conditions, NCp9 has the strongest 
chaperone activity (T. Wu et al , submitted for publication). 
Using a different assay system, it has been shown that al- 
though NCp9 is slower than NCp7 and NCpl 5 in stimulat- 
ing immature genomic RNA dimerization, it is capable of 
facilitating mature dimer formation, while NCpl 5 cannot 
(95). While the relative levels of chaperone activity may dif- 
fer, depending on the assay used, the in vitro activity differ- 
ences cannot fully explain the dramatic infectivity decreases 
observed for HIV-1 NCpl 5-containing virus. 

Sedimentation assays and DLS experiments showed that 
NCp7 and NCp9 both promote the formation of large 
protein-RNA aggregates. In contrast, NCpl5-RNA com- 
plexes could not be pelleted (in the presence of 0. 1% Tween 
20) and failed to scatter hght effectively by DLS (Figure 
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Table 3. Single-round infectivity analysis of WT and mutant HIV-1 


Virus 


Relative infectivity (% of WT)^ 


Relative infectivity (% of NCp 15 wt)^ 


(-)-ControF 


0 


0 


WT 


100 


NA 


NCplSwT 


0.96 ± 0.25 


100 


NCp 1 5(PTAP to LIRL) 


0.073 ± 0.052 


6.4 zt 4.3 


NCpl5(FRFG to SRSG) 


0.65 ± 0.41 


79d=9 


NCpl5(PTAP to LIRL/FRFG to SRSG) 


0.11 ±0.05 


10 ±4 


WT-8A^ 


6.2 ± 4.7 


NA 


NCp 1 5-8 


0.20 ± 0.01 


29 ± 1 



^Single-round TZB-bl infectivity assays were performed as described previously (83). Averages ± standard deviations of exogenous template reverse tran- 
scriptase (RT) activity-normalized titers are reported as % of WT HIV- 1 from two to four independent infection experiments. 

^Averages ± standard deviations of exogenous template RT activity-normalized titers are reported as % of NCpl5wT HIV-1 from two to four independent 

infection experiments. NA is not applicable. 

'^Infectivity of supernatants from mock-transfected 293T cells. 

^The 8A variants contain eight acidic residue changes to alanine in p6, as shown in Figure 1. 



5C). This is in accord with a previous electron microscopy 
study showing that NCp 15 binds ssDNA like 'beads on a 
string', unlike NCp7 or NCp9, which form condensed ag- 
gregates (96,97). 

The distinct NA interaction mode of NCp 15 is regulated 
by the acidic p6 domain. The results of NMR studies per- 
formed here provide support for a fold-back conformation 
in solution for NCp 15, wherein the acidic p6 domain inter- 
acts with the basic ZF domain. This interaction seems to be 
electrostatic, since at 500-mM NaCl the chemical shift per- 
turbations are weaker. Neutralizing the C-terminal acidic 
residues in p6 improves NCpl5's chaperone function (Fig- 
ure 3B), weakens the fold-back conformation and results in 
larger aggregates upon NA binding (Figure 5D). Adding p6 
peptide in trans also partially inhibits the annealing activity 
ofNCpT (Figure 3C). 

In the NA-bound state, similar inter-protein interac- 
tions between the p6 and ZF domains of neighboring 
NCp 15 molecules would lead to slower protein dissociation 
from NA. Indeed, single-molecule DNA stretching studies 
strongly support this idea. According to the observed DNA 
stretching hysteresis (Supplementary Figure S8), most of 
the DNA-bound NC molecules are in equilibrium on the 
time scales of 10-100 s, as previously observed (84). These 
DNA-bound NC molecules behave like mobile multivalent 
cations, optimizing their positions to maximize electrostatic 
self-attraction (98,99) and aggregate density (96,100-103). 
Compared to NCp7 and NCp9, NCp 15 has a larger over- 
all non-equilibrium fraction of DNA-bound NC molecules. 
It also has a larger fraction of DNA-bound NC molecules 
that dissociate on slow time scales of ~1000 s (Figure 7 
and Supplementary Table SI) and do not contribute to 
self-attraction and aggregation. Although the differences 
between the slow fractions of DNA-bound NCp7, NCp9 
and NCp 15 are moderate, a critical concentration of highly 
charged mobile cations is known to be required for NA ag- 
gregation (98). This condition appears to be satisfied for 
NCp7 and NCp9, but not for NCp 1 5 . The fold-back confor- 
mation observed when free in solution and the more open 
conformation when bound to NAs may also explain the re- 
quirement of NA binding for efficient HIV-1 protease cleav- 
age of NCpl5 at SP2/p6 (97,104). A variant of NCpl5 that 
is defective for binding RNA cannot be processed by HIV-1 
protease efficiently (105). 



Similar CTD/NTD interactions have been proposed for 
HTLV-1 NC (50,51). HTLV-1 NC is a poor NA chaperone, 
but deletion of its acidic CTD restores chaperone function 
(50,75). The presence of the CTD also greatly slows down 
the NA interaction kinetics of HTLV-1 NC (50,75). Thus, 
for both HTLV-1 NC and HIV-1 NCp 15, the acidic CTD 
negatively regulates chaperone function and NA binding ki- 
netics, although the effect in the case of NCp 15 is milder 
(50). Comparing the sequences of p6 in HIV-1 NC with the 
CTD of HTLV-1 NC (50,75) provides a plausible explana- 
tion for the much weaker effects of the former. It is well 
known that the interaction between macromolecules cor- 
relates with their charge density, but not their net charge 
(106,107). Indeed, the region of highest negative charge den- 
sity within the CTD of HTLV-1 NC (eight residues, 57-64) 
contains five anionic residues. In contrast, the highest nega- 
tive charge density region of p6 (15 residues, 77-91) has five 
negatively charged amino acids and one positive residue. 

Both NCp 15 and HTLV-1 NC bear some resemblance 
to the prototypical ssDNA binding protein of the bacterio- 
phage T4 gp32. gp32 has a highly negatively-charged CTD, 
which in the unbound state binds via intramolecular inter- 
actions to the protein's cationic ssDNA binding site, thereby 
slowing the NA binding on rate (108-1 10). Upon binding to 
ssDNA, gp32 forms a filament and binds in an intermolecu- 
lar fashion to neighboring gp32 molecules. The gp32-gp32 
interactions are hydrophobic in nature and are not associ- 
ated with the electrostatic CTD-ssDNA binding site inter- 
actions (111-113). Instead, in its ssDNA-bound form, the 
CTD of gp32 is solvent-exposed for additional interactions 
with regulatory proteins (114). 

Prior to viral assembly, NCp 15 is appended to Gag's C- 
terminus. If the NA interaction properties of NCp 15 in the 
context of Gag resemble those of the processed protein, the 
beads-on-a-string-type interaction with the HIV-1 genome 
may initially prevent NA aggregation and expose the cleav- 
age sites for processing (97,104). During maturation, the 
first cleavage happens between the MA-CA-SPl domain 
and the NCp7-SP2-p6 domain, thereby liberating NCp 15 
(39). It is possible that this form of NC is unable to effec- 
tively aggregrate the RNA, whereas the next cleavage sep- 
arates the p6 domain from NCp 15 leading to densely ag- 
gregated gRNA-NC complexes that may be more readily 
packaged into the viral capsid. Based on the observations 
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reported here, as well as previous work (96,97), the gRNA 
complex with unprocessed NCpl5 protein is expected to 
have a much larger volume, which would create a signifi- 
cantly higher osmotic pressure when packaged into mature 
HIV-1 capsid. In accord with this expectation, virions with 
NCpl5 processing defects show deformed capsid structures 
with significantly larger volumes than WT virions (46). This 
defect in mature capsid core formation appears to occur not 
only when NCpl5 cleavage is completely blocked but also 
when the timing of the cleavage is altered (46), leading to 
incomplete NC processing prior to mature capsid assembly 
(40). Indeed, variations in Gag processing rates in response 
to protease inhibitors lead to core structure and stabihty 
changes and major infectivity defects (115-117), which fur- 
ther highlights the requirement for NCpl5 processing prior 
to mature capsid formation (37,41). 

Neither disrupting p6-Tsgl01 nor p6-Vpr interaction 
could rescue viral infectivity of a SP2/p6 cleavage mutant 
(Table 3), showing that these interactions are hkely not the 
cause of non-infectious virus. The results of recent studies 
highlight the importance of a mature core of optimal sta- 
bihty on viral infectivity and suggest the presence of an in- 
tact capsid structure through at least a portion of the reverse 
transcription process [(118) and references therein]. More- 
over, variations in WT core stability have dramatic effects on 
reverse transcription (119), whereas inhibiting reverse tran- 
scription stabilizes the core (118,120). We propose that the 
NTD-CTD interaction present in NCpl5 disrupts this pro- 
tein's aggregation capabilities, thus leading to abnormally 
large volumes of NC-gRNA complexes, associated mature 
cores of lower stability and the observed detrimental effects 
on viral infectivity. 
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